On Using Incremental Profiling for the Performance Analysis of Shared Memory Parallel Applications
نویسندگان
چکیده
Profiling is often the method of choice for performance analysis of parallel applications due to its low overhead and easily comprehensible results. However, a disadvantage of profiling is the loss of temporal information that makes it impossible to causally relate performance phenomena to events that happened prior or later during execution. We investigate techniques to add temporal dimension to profiling data by incrementally capturing profiles during the runtime of the application and discuss the insights that can be gained from this type of performance data. The context in which we explore these ideas is an existing profiling tool for OpenMP applications.
منابع مشابه
Performance Analysis of Shared-Memory Parallel Applications Using Performance Properties
Tuning parallel code can be a time-consuming and difficult task. We present our approach to automate the performance analysis of OpenMP applications that is based on the notion of performance properties. Properties are formally specified in the APART specification language (ASL) with respect to a specific data model. We describe a data model for summary (profiling) data of OpenMP applications a...
متن کاملMessage-passing Over Shared Memory for the SECK Programming Environment
Message-passing is a representative communication model in today’s parallel and distributed programming, and should be efficiently supported even for multithreaded-only parallel programs. This papers describes the design and implementation of a communication mechanism which emulates message passing on top of shared memory for multithreaded applications. The mechanism is implemented in the DECK ...
متن کاملShared Memory Computing on Networks of Workstations
TreadMarks supports parallel computing on networks of workstations by providing the application with a shared memory abstraction Shared memory facilitates the transition from sequential to parallel programs After identifying possible sources of parallelism in the code most of the data structures can be retained without change and only synchronization needs to be added to achieve a correct share...
متن کاملTreadMarks Shared Memory Computing on Networks of Workstations
TreadMarks supports parallel computing on networks of workstations by providing the application with a shared memory abstraction Shared memory facilitates the transition from sequential to parallel programs After identifying possible sources of parallelism in the code most of the data structures can be retained without change and only synchronization needs to be added to achieve a correct share...
متن کاملThreadMarks: Shared Memory Computing on Networks of Workstations
TreadMarks supports parallel computing on networks of workstations by providing the application with a shared memory abstraction. Shared memory facilitates the transition from sequential to parallel programs. After identifying possible sources of parallelism in the code, most of the data structures can be retained without change, and only synchronization needs to be added to achieve a correct s...
متن کامل